Rank | Count | Beginning |
---|---|---|
8 | 788 | В |
32 | 387 | 09 |
22 | 374 | По |
15 | 181 | Как |
68 | 153 | Но |
397 | 150 | На |
190 | 131 | А |
78 | 126 | И |
6 | 112 | Он |
39 | 110 | При |
30 | 102 | Кроме |
54 | 99 | Это |
38 | 95 | Однако |
9 | 76 | Об |
120 | 61 | Если |
82 | 60 | Для |
67 | 58 | С |
325 | 56 | Напомним, |
197 | 54 | Я |
61 | 51 | У |
324 | 47 | После |
24 | 45 | К |
121 | 45 | Мы |
27 | 43 | Они |
14 | 42 | За |
29 | 40 | Ранее |
17 | 39 | Так, |
12 | 38 | 2009 |
585 | 38 | Во |
126 | 37 | Поэтому |
In the next four subsections show the most frequent sentence beginnings consisting of N words, N=1, 2, 3, 4. In this subsection we start with N=1.
The most frequent word-N-grams at the beginning of sentences give some insight into sentence composition.
Especially for N=1, we only need a small corpus to identify the most frequent sentence beginnings.
select substring_index(sentence, ' ', 1) as beg, count(*) as cnt from sentences group by substring_index(sentence, ' ', 1) order by cnt desc limit 50;
4.3.1.2 Most Frequent Sentence Beginnings II
4.3.1.3 Most Frequent Sentence Beginnings III
4.3.1.4 Most Frequent Sentence Beginnings IV
4.3.1.1 Most Frequent Sentence Endings I
4.3.1.2 Most Frequent Sentence Endings II
4.3.1.3 Most Frequent Sentence Endings III
4.3.1.4 Most Frequent Sentence Endings IV